SemanticScuttle - klotz.me » Tags: architecture+deep learning

Tags: architecture* + deep learning*

0 bookmark(s) - Sort by: Date ↓ / Title /

The Big LLM Architecture Comparison

A detailed comparison of the architectures of recent large language models (LLMs) including DeepSeek-V3, OLMo 2, Gemma 3, Mistral Small 3.1, Llama 4, Qwen3, SmolLM3, and Kimi 2, focusing on key design choices and their impact on performance and efficiency.

2025-07-19 Tags: llm, large language models, deep learning, ai, architecture, deepseek, olmo, gemma, mistral, llama, qwen, smollm, kimi, moe, attention, transformers by klotz
How to Easily Draw Neural Network Architecture Diagrams | by Kenneth Leung | Towards Data Science

2023-04-02 Tags: neural network, architecture, diagrams, kenneth leung, computational neuroscience, neural networks by klotz
Main Types of Neural Networks and its Applications — Tutorial | Towards AI — Multidisciplinary Science Journal

2020-07-14 Tags: neural network, architecture, deep learning, tutorial by klotz
11 Essential Neural Network Architectures, Visualized & Explained

2020-06-29 Tags: neural network, architecture, deep learning by klotz
Paper Dissected: "Attention is All You Need" Explained | Machine Learning Explained

2019-03-21 Tags: deep learning, architecture, attention, lstm, google, transformer, encoder, decoder by klotz
machine learning - How to input multiple categorical variables to Neural Network - Cross Validated

mixing categorical and numerical inputs with embedding

2019-02-16 Tags: neural network, embedding, architecture, cassandra by klotz
“Simple diagrams of convoluted neural networks”

2018-09-16 Tags: deep learning, cnn, visualization, architecture by klotz
How Zendesk Serves TensorFlow Models in Production – Zendesk Engineering – Medium

2017-02-26 Tags: architecture, zendesk, aws, machine learning, tensorflow by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0

About - Propulsed by SemanticScuttle